Near-optimal clustering in the k-machine model

نویسندگان

چکیده

The clustering problem, in its many variants, has numerous applications operations research and computer science (e.g., bioinformatics, image processing, social network analysis, etc.). As sizes of data sets have grown rapidly, researchers focused on designing algorithms for problems models computation suited large-scale such as MapReduce, Pregel, streaming models. k-machine model (Klauck et al., (SODA 2015) [8]) is a simple, message-passing distributed graph processing. This paper considers three the most prominent examples problems: uncapacitated facility location p-median p-center problem presents O(1)-factor approximation these running O˜(n/k) rounds model. These are optimal up to polylogarithmic factors because this also shows Ω˜(n/k) lower bounds obtaining polynomial-factor problems. first results We assume that metric provided input only implicitly provided, an edge-weighted nutshell, our main technical contribution show constant-factor all can be obtained by learning small portion metric.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SVM Venn Machine with k-Means Clustering

In this paper, we introduce a new method of designing Venn Machine taxonomy based on Support Vector Machines and k-means clustering for both binary and multi-class problems. We compare this algorithm to some other multi-probabilistic predictors including SVM Venn Machine with homogeneous intervals and a recently developed algorithm called Venn-ABERS predictor. These algorithms were tested on a ...

متن کامل

Near-Optimal Virtual Machine Packing Based on Resource Requirement of Service Demands Using Pattern Clustering

Upon the expansion of Cloud Computing and the positive outlook of organizations with regard to the movements towards using cloud computing and their expanding utilization of such valuable processing method, as well as the solutions provided by the cloud infrastructure providers with regard to the reduction of the costs of processing resources, the problem of organizing resources in a cloud envi...

متن کامل

Comparing Model-based Versus K-means Clustering for the Planar Shapes

‎In some fields‎, ‎there is an interest in distinguishing different geometrical objects from each other‎. ‎A field of research that studies the objects from a statistical point of view‎, ‎provided they are‎ ‎invariant under translation‎, ‎rotation and scaling effects‎, ‎is known as the statistical shape analysis‎. ‎Having some objects that are registered using key points on the outline...

متن کامل

Almost Optimal Solutions to k-Clustering Problems

Pankaj Kumar1 and Piyush Kumar2 1 Purdue University 2 Florida State University (a) American Cities (b) Lyria bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bcbc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bcbc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc bc...

متن کامل

Optimal clustering for detecting near-native conformations in protein docking.

Clustering is one of the most powerful tools in computational biology. The conventional wisdom is that events that occur in clusters are probably not random. In protein docking, the underlying principle is that clustering occurs because long-range electrostatic and/or desolvation forces steer the proteins to a low free-energy attractor at the binding region. Something similar occurs in the dock...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Theoretical Computer Science

سال: 2022

ISSN: ['1879-2294', '0304-3975']

DOI: https://doi.org/10.1016/j.tcs.2021.11.026